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CRYSTALT>OftRAPHTC  STUDIES  OF  THE  ANTHRAX  LETHAL  TOXIN 


TNTRODUCTTON 

1.  Background 

Anthrax  is  primarily  a  disease  of  herbivorous  animals,  particularly  sheep  and 
cattle  (1).  Humans  may  acquire  the  disease  from  infected  animals,  typically  as  a 
cutaneous  infection  characterized  by  black  pustules  (2).  The  pulmonary  infection 
results  from  the  inhalation  of  anthrax  spores,  as  a  result  of  handling  contaminated 
animal  hides  or  from  the  release  of  man-made  stores  (16,17),  and  can  lead  to  death 
within  days  (3-5).  The  causative  agent  of  anthrax  is  the  gram-positive,  aerobic, 

spore-forming  Bacillus  antliracis  (6).  The  pathogenesis  of  B.  anthracis  depends  on 
two  major  virulence  determinants:  'anthrax  toxin',  and  an  antiphagocytic  poly(D- 

glutamic  acid)  capsule,  each  encoded  by  a  separate  plasmid  (see  (7)  for  a  general 
review).  The  term  'anthrax  toxin'  refers  collectively  to  three  proteins  of  similar 
molecular  weight,  encoded  by  plasmid  pXOl:  Lethal  Factor  (LF;  776  a.a.).  Edema  Factor 
(EF;  767  a.a.),  and  Protective  Antigen  (PA;  735  a.a.).  The  proteins  lack  toxicity  when 
administered  individually  (8),  but  two  binary  combinations  are  toxic:  co-injection  of 
PA  with  EF  ("edema  toxin")  produces  edema  in  the  skin  of  animals  and  inhibits  the 
function  of  phagocytic  cells  (9),  while  the  combination  of  PA  and  LF  ("lethal  toxin") 
induces  death  (10,11).  The  shock  and  death  resulting  from  systemic  anthrax  are 
primarily  due  to  the  effects  of  the  lethal  toxin,  as  animals  treated  with  lethal  toxin 
show  symptoms  closely  resembling  those  characterizing  infection  with  the  bacillus 
(12,13,14). 

There  is  a  clear  need  for  an  improved  Anthrax  vaccine  for  human  use  (15-17): 
the  currently  licensed  vaccine  in  the  United  States,  consisting  of  a  cell-free  filtrate 

of  anthrax  absorbed  onto  aluminum  hydroxide,  requires  multiple  doses,  produces 
frequent  side-effects,  and  is  not  effective  against  certain  virulent  strains.  Much 

effort  has  been  directed  towards  the  goal  of  a  human  vaccine  that  is  fully  defined 
chemically,  contains  only  essential  ingredients,  confers  protection  in  a  single  dose, 
and  has  no  side-effects.  All  the  Anthrax  vaccines  currently  in  use  or  under 
development  contain  or  produce  Protective  Antigen.  Lethal  Factor  and  Edema  Factor 
have  also  demonstrated  potential  as  protective  immunogens,  whereas  structural 
components  such  as  capsule,  surface  polysaccharide  and  surface  proteins  have  not. 

The  work  described  here  aims  at  determining  the  atomic  resolution  structure 
of  the  anthrax  protective  antigen  in  its  monomeric  and  oligomeric  forms.  We 
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anticipate  that  the  three-dimensional  data  derived  will  be  beneficial  in  the 
development  of  the  next  generation  of  Anthrax  vaccines.  Our  structural  work, 
together  with  the  biochemical  and  genetic  work  of  our  collaborators,  should  allow  us 
to  define  the  domain  structures  of  PA  involved  in  receptor  binding.  Lethal  Factor 
binding,  membrane  insertion  and  translocation,  and  the  biochemical  basis  of  Lethal 
Toxin  lethality.  This  information  can  be  used  in  the  construction  of  recombinant 

domain  fragments  and  mutant  proteins  that  lack,  of  course,  many  of  the  key 
functions  of  the  parent  protein,  but  are  highly  immunogenic  and  have  improved 
therapeutic  properties. 

2.  Intoxication  by  Anthrax 

The  three  exotoxin  proteins  are  the  edema  factor  (EF),  the  lethal  factor  (LF), 

and  the  protective  antigen  (PA),  collectively  called  "anthrax  toxin"  (18).  In  the  "A-B" 

toxin  nomenclature,  PA  is  the  B  moiety  and  LF  and  EF  are  alternative  A  moieties. 
Anthrax  toxin  (in  common  with  certain  clostridial  toxins)  is  classified  as  a  binary  A- 
B  toxin  because  the  two  moieties  exist  as  separate  gene  products,  not  as  domains 

within  a  polypeptide  (19).  The  mature  anthrax  toxin  polypeptides  have  molecular 
weights  in  the  range  83-90kDa,  and  their  genes  have  been  cloned  and  sequenced  (20- 
23).  Mediating  the  effects  of  lethal  toxin  is  the  macrophage.  In  vitro,  only  cells  of  the 
macrophage  lineage  are  affected  by  exposure  to  lethal  toxin  (18);  high 

concentrations  lead  to  cytolysis  (18,25),  likely  mediated  by  the  overproduction  of 
reactive  free  radical  oxygen  intermediates  (26),  while  lower  concentrations  can 
induce  cytokine  overproduction  (24),  proposed  to  be  the  cause  of  the  systemic  shock 
and  death  associated  with  anthrax  (24).  In  a  recent  study,  Hanna  and  colleagues 
showed  that  mice  depleted  of  macrophages  are  resistant  to  lethal-toxin  challenge, 
and  regain  sensitivity  upon  injection  of  toxin-sensitive  macrophages  (24). 

The  events  that  occur  during  intoxication  of  the  cell  are  as  follows.  The  first 
step  involves  the  binding  of  PA  to  a  receptor  on  the  host  cell  surface  (12).  The 
receptor  is  found  in  a  variety  of  mammalian  cell  lines  (12)  and  appears  to  be  a 
membrane  protein  of  85-90  kDa  with  a  high  affinity  for  PA  (Kj  ~  1  nM)  (27).  Once 
bound  to  the  receptor,  PA  is  cleaved  by  a  cell-surface  protease  believed  to  be  furin 

(28,29),  a  ubiquitous,  subtilisin-like  endoprotease  (30).  Proteolysis  releases  an  N- 
terminal  20  kDa  fragment,  PA20.  from  the  cell  surface  (31)  and  exposes  a  high- 

affinity  binding  site  (Kd  =  10  pM)  (7)  for  LF  or  EF  on  the  63  kDa  fragment.  PA63»  still 
attached  to  the  receptor,  then  binds  LF  or  EF,  and  the  entire  complex  undergoes 

receptor-mediated  endocytosis  (32).  Acidification  of  the  vesicle  leads  to  insertion  of 
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PA63  iiito  the  endosomal  membrane  (7)  and  translocation  of  EF  or  LF  bilayer  into  the 
cytosol  where  they  exert  their  toxic  effects  -  adenylate  cyclase  activity  in  the  case  of 
EF  (33),  and  (probably)  proteolytic  activity  in  the  case  of  LF  (34). 

It  has  been  recently  shown  that  PA63  forms  a  heptameric,  ring-like  structure 
in  vitro  (see  below),  which  is  probably  the  membrane-inserting  form  in  vivo.  It  is 
not  yet  clear  if  oligomerization  occurs  at  the  cell  surface  or  after  acidification  of  the 
endosome,  and  the  precise  sequence  of  EF/LF  binding,  oligomerization  and 
membrane  insertion  has  yet  to  be  established. 

3.  The  PA^3  Oligomer 

Leppla  showed  that  following  proteolytic  cleavage  of  PA  by  trypsin  in  vitro, 
PA 63  associates  into  an  SDS-resistant  oligomeric  structure  which  binds  EF  and  LF 
with  high  affinity  (12).  Milne  et  al  have  recently  shown  that  such  an  oligomer  forms 
in  mammalian  cells  and,  judged  by  electron  microscopy,  is  a  heptamer  (35).  The 
oligomer  appears  as  a  compact  ring  made  of  seven  subunits.  Two  domains  can  be 
distinguished,  a  larger  one  forming  the  body  of  the  ring,  and  a  smaller  protrusion. 
The  outer  diameter  of  the  rings  is  roughly  140  A  including  the  protrusions  and  104  A 
excluding  them,  while  the  inner  diameter  is  20  A  (35).  In  both  artificial  lipid  bilayers 
(36-38)  and  cells  (39),  PA63  inserts  into  membranes  and  forms  ion-conductive 
channels.  Channel  formation  in  planar  lipid  bilayers  is  rapidly  accelerated  when  the 
pH  is  lowered  from  7.4  to  6.5  (36).  In  liposomes  the  formation  of  channels  occurs  at 
pH  6.0,  is  maximal  at  pH  4.7,  and  is  only  observed  in  the  presence  of  a  pH  gradient 
(39).  Although  the  precise  pH  requirements  vary  with  experimental  design,  channel 
formation  by  PA6  3  does  not  require  as  low  a  pH  as  does  that  by  the  B  chains  of 
diphtheria,  tetanus  and  botulinum  toxins  (40).  PA63  channels  appear  to  be  selective 
for  monovalent  cations  (36)  and  can  be  blocked  by  the  addition  of  LF  (41). 
Permeability  to  tetraheptylammonium  ions  indicates  that  the  channel  diameter  is  at 
least  12  A  (42-44),  consistent  with  the  electron  microscopy.  We  have  crystallized  the 
heptamer  in  a  water-soluble  form  at  high  pH,  and  have  begun  a  full  crystallographic 
analysis.  This  information  should  provide  further  data  on  those  parts  of  the  PA 
molecule  which  are  exposed  to  solvent,  and  therefore  further  map  the  immunogenic 
surfaces  of  the  molecule. 
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BODY 


1.  Crystallographic  analysis 

Data  Collection,  Model  Building  and  Refinement  of  PA 

Date  were  collected  on  our  in-house  system  (Rigaku  RU200  generator,  graphite 
monochromator  and  Xuong-Hamlin  multiwire  detector),  at  the  Stanford  Synchrotron 
(MarRessearch  Imaging  Plate)  and  at  the  Cornell  Synchrotron  (CCD  detector).  The 
structure  of  PA  was  determined  by  the  conventional  method  of  Multiple  Isomorphous 
Replacement  (MIR),  using  6  heavy  atom  derivative  (Table  1).  Most  of  the  work  was 
done  with  a  crystal  form  grown  from  PEG  1000;  towards  the  end  of  the  work  a  second 
crystal  form  was  employed,  grown  from  PEG  8000,  that  diffracted  to  higher  resolution 
(2.1  A).  Heavy  atom  positions  were  determined  by  inspection  of  Isomorphous 

Difference  Pattersons  and  Difference  Fouriers.  Heavy  atom  parameters  were  refined 
using  the  programs  HEAVY  (45)  and  MLPHARE  (46).  An  initial  model  was  built  into  an 

MIR  map  using  the  graphics  program  FRODO  (47).  The  initial  map  was  not  of  high 

quality,  but  various  residue  markers  allowed  for  some  stretches  of  the  chain  to  be 
assigned;  the  iodine  derivative  labeled  tyrosine  residues,  the  cysteine  point  mutants 
were  labeled  with  mercury,  and  the  compound  methylmercuric  nitrate  often  bound 
to  histidines.  The  large  tryptophan  sidechains  were  also  clearly  identified.  However, 
most  residues  were  initially  modeled  as  polyalanine.  The  density  corresponding  to  the 
stretch  from  Ser^^^  to  Leu^^l  was  particularly  well  defined,  and  tracing  was 

facilitated  by  the  presence  of  the  markers  Trp346^  Hg-labeled  His^^^  and  Hg-labeled 
Cys357  (in  crystals  of  the  point  mutant  T357C).  The  initial  model  consisted  of  10 
continuous  polypeptide  segments  accounting  for  524  residues  and  2843  non¬ 
hydrogen  atoms.  The  model  was  refined  against  the  15-2. 5A  native  data  collected  at 
the  Stanford  synchrotron  using  the  program  XPLOR  (48).  Simulated  annealing  (SA) 
was  performed  by  heating  the  model  to  1500  K  and  letting  it  cool  in  steps  of  50K  The 
crystallographic  R-factor  (fractional  discrepancy  between  observed  and  calculated 
model-derived  data)  after  refinement  was  0.437  for  all  reflections  with  Fq  >  2o.  Phases 
were  then  calculated  from  the  refined  atomic  coordinates  to  3.2  A  resolution  and 
recombined  with  the  original  MIR  phases  using  the  program  SIGMAA  (49).  The  new 
“recombined  map”  showed  a  marked  improvement  over  the  original  MIR  map.  New 
density  appeared,  allowing  for  further  residues  to  be  modeled.  Regions  of  density  into 
which  model  had  been  correctly  built  were  better  defined,  while  those  where  errors 
had  been  made  became  less  defined.  Model  in  the  poorer  regions  was  removed  and 


4 


Table  1:  Crystallographic  Statistics 
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(without  further  refinement)  new  phases  were  calculated  and  recombined  with  the 
MIR  phases.  In  this  "omit-recombined  map”,  the  correct  tracing  was  more  apparent. 
The  new  model  was  then  subjected  to  a  second  round  of  refinement,  which  reduced 

the  R-factor  to  0.419.  New  phases  were  calculated  to  2.5  A  from  this  refined  model  and 
were  combined  with  MIR  phases  previously  modified  and  extended  to  2.5  A  using  the 
program  SQUASH  (50).  This  cycle  of  model-building,  refinement  and  phase 
combination  was  iterated.  In  the  initial  rounds  a  conservative  strategy  was  adopted 
during  map  interpretation  by  deliberately  underestimating  the  electron  density.  Phe 
and  Trp  residues  were  placed  where  obvious,  density  consistent  with  medium-sized 
residues  was  modeled  as  Asn,  and  ambiguous  density  was  modeled  as  polyAla,  polyGly 
or  not  at  all.  After  round  7,  restrained  B  factor  refinement  was  performed  and 
higher  temperatures  were  used  in  the  simulated  annealing.  Maps  inspected  during 

the  model  building  phase  eventually  included  2Fo-Fc  maps,  positively  and  negatively 

contoured  Fq-Fq  maps,  recombined  maps,  omit-recombined  maps  and  SA-omit  maps, 
calculated  to  2.5  A  and  to  3.2  A.  Successive  rounds  of  the  refinement/phase 

combination  procedure  led  to  the  location  of  more  and  more  sequence  and  to  a 
gradually  improving  image  of  the  molecule.  The  C-terminus  was  located  quite  early, 
marked  by  Phe^27^  iodine-labeled  Tyr^32  uranyl-labeled  Glu^33 

At  this  stage  the  model  accounted  for  709  of  the  735  residues,  of  which  20  were 
modeled  as  glycine  or  alanine.  The  crystallographic  and  free  R-factors  to  2.5A  (Fq 

>2sigFo)  were  23.4  %  and  38.7%,  respectively  (no  water  molecules  were  placed  at  this 
stage).  Some  of  the  poorest  electron  density  was  in  the  interface  between  domains  I 
and  II  (residues  159  to  219),  so  that  the  placement  of  domain  I  with  respect  to  the  rest 
of  the  molecule  was  uncertain,  since  two  different  positions  related  by 

crystallographic  symmetry  were  feasible.  To  resolve  this  issue  we  pursued  a  different 
crystal  form  of  PA,  grown  from  PEG  8000  (15-20%),  50mM  citrate  pH  6.  The  space 
group  is  P2i2i2i,  with  a=101.A,  b=94.4A,  c=84.3A.  Suitable  flash  freezing  conditions 
were  determined:  15%  glycerol,  20%  PEG8K,  50mM  citrate.  These  crystals  diffract 
better  than  the  PEG  1000  form,  and  native  data  to  2.1  A  resolution  were  collected  from 
frozen  crystals  at  the  Cornell  Synchrotron.  The  structure  has  been  solved  by  the 

method  of  Molecular  Replacement,  using  the  atomic  model  of  the  first  crystal  form  as 
the  search  object  to  be  positioned  into  the  new  crystal.  This  newly  positioned  model 
has  been  refined,  using  simulated  annealing  with  the  program  XPLOR  to  the 

resolution  limit  of  the  data.  The  R-factor  is  currently  29%.  The  addition  of  water 
molecules  to  this  model  should  reduce  the  R-factor  to  an  acceptable  range  (18-22%). 
On  the  basis  of  this  new  model,  we  are  now  confident  of  the  domain  boundaries  and 
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the  positions  of  all  ordered  residues  in  our  model  of  the  Protective  Antigen  (720  out  of 
735  amino-acids). 

Cysteine  point  mutants 

Leppla  supplied  us  with  10  cysteine  point  mutants  (the  wild-type  protein 
contains  no  cysteines).  We  attempted  to  crystallize  these  mutants  and  study  crystals 
soaked  in  methylmercuric  nitrate,  initially  in  the  hope  that  mercury  would  bind  to 
cysteine  and  provide  three-dimensional  markers  facilitating  the  chain  trace.  In  two 
cases  this  was  successful,  and  the  data  are  summarized  in  Table  2.  This  information  is 
described  in  detail  since  these  mutants  may  be  used  by  our  collaborators  to  study  the 
process  of  oligomerization  and  membrane  insertion  via  spin-labeling.  Three  mutants 
failed  to  crystallize:  S248  is  buried,  S278  is  in  an  ordered  loop  that  is  involved  in  a 
crystal  contact,  and  S249  in  an  exposed  loop  that  forms  part  of  a  crystal  contact. 


Table  2: 

Cysteine 

point  mutants 

Mutant 

Crystallizes? 

Specific 

Hg- 

site? 

Environment 

of 

residue 

Residue 
found  in 

S170C 

not 

tested 

(not  visible) 

vicinity  of 
cleavage  site 

S248C 

- 

N/A 

partly  buried 

domain 

interface 

S278C 

- 

N/A 

crystal 

contact 

ordered  loop 

F313C 

+ 

- 

(not  visible) 

disordered  loop 
(302-24) 

T357C 

+ 

+ 

exposed 

P  strand 

S429C 

- 

N/A 

crystal 

contact 

loop 

T488C 

+ 

- 

bulled 

P  stiaiid 

T548C 

+ 

- 

ciystal 

contact 

loop 

S623C 

+ 

+ 

exposed 

P  strand 

S717C 

+ 

not 

tested 

exposed 

loop 
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Seven  mutants  crystallized;  five  were  of  sufficient  size  for  data  colection:  of  these 

five  three  did  not  produce  useful  derivatoves:  F313  is  in  a  disordered  loop,  T488  is 

buried  and  T548  is  in  an  ordered  loop  that  is  involved  in  a  crystal  contact.  Crystals  of 
only  two  mutants,  T357C  and  S623C,  were  able  to  bind  a  Hg  atom  in  an  ordered  fashion, 
and  our  model  demonstrates  that  residues  357  and  623  are  on  p -strands  exposed  to 
solvent.  These  studies  suggest  that  most  of  the  mutants  are  suitable  for  spin-label 

studies,  and  that  the  cysteines  are  available  for  derivatization  (but  either  fail  to 
crystallize  due  to  proximity  to  a  crystal  contact  or  are  too  mobile  to  be  seen 
crystallographically).  The  exceptions  are  S248,  found  at  the  domainl-2  interface;  and 
T488  which  is  buried. 

PA63  Oligomer 

We  have  determined  conditions  for  reproducibly  seeding  small  crystals  of 
PA63  in  order  to  produce  large  crystals.  The  crystals  have  been  fully  characterized, 
and  belong  to  space  group  P2i,  with  a=149A  b=14lA,  c=163A,  b=105^,  and  diffract  to 
about  4.5A  resolution.  The  crystal  s  are  very  radiaitin  sensitive,  so  we  have  detrmined 
conditions  for  flash-freezing  them  and  have  collected  a  complete  data  set  to  7.5A 
using  our  in-house  facilities,  and  have  begun  heavy  atom  derivative  searches.  From 
the  diffraction  data  it  is  possible  to  search  for  internal  symmetry  in  the  molecule 

before  any  heavy  atom  data  are  available.  We  have  done  this,  using  a  "self- rotation 
function",  and  find  that  the  PA63  molecule  has  a  7-fold  axis  of  symmetry,  and  we 
calculate  that  the  crystal  contains  about  70%  solvent.  This  information  confirms  that 
PA63  crystallizes  as  a  heptamer,  which  is  the  biologically  active  species  that  has  been 
observed  by  electron  microscopy  and  by  studies  of  infected  cells.. On  the  basis  of 
crystal  packing  analysis  and  electron  microscopy  measurements  (39),  we  have  built  a 
hypothetical  model  for  the  heptamer  (see  below). 

2.  Three-dimensional  Structure  of  the  protective  antigen  (PA) 

The  molecule  is  about  100  A  tali  by  50-70  A  wide  and  30-40  A  deep  (figure  1). 
The  sequence  with  secondary  structure  assignments  is  shown  in  figure  2.  It  consists 
of  four  domains  organized  predominantly  into  p -sheets,  with  only  a  few  short  helices 
(^4  turns  long).  The  domain  structures  are  discussed  in  detail  below.  Domains  1  and  2 
are  approximately  equal  in  size,  with  -250  residues  each.  Domain  1  contains  PA20>  Ihe 
fragment  removed  by  the  cell  surface  protease.  The  function  of  domain  2  is  not 
known,  but  it  may  play  a  role  in  membrane  insertion  and/or  oligomerization.  Domain 
3  is  the  smallest  of  the  domains  with  about  100  residues;  its  function  is  also  currently 
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Figure  1:  schematic  structure  of  the  Protective  Antigen 

unknown.  Domain  4  contains  the  C-terminal  140  residues  and  is  probably  the 
receptor-binding  domain. 

Domain  1 

Domain  1  consists  of  residues  1-249.  Of  these,  the  first  220  fold  into  13  p-strands 
arranged  in  an  antiparallel  fashion  in  2  sheets  of  unequal  size.  Protease  cleavage  on 
the  cell  surface  occurs  at  the  sequence  RKKR^^^  (51),  which  forms  an  exposed  loop 
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between  two  strands  of  the  larger  sheet.  Mutation  of  the  cleavage  site  to  prevent 
proteolysis  results  in  a  loss  of  toxic  activity  (52).  Residues  making  up  the  PA2  0 
fragment  form  a  p -sandwich  composed  of  the  small  sheet  and  6  of  the  9  strands  of  the 
large  sheet.  That  the  cleavage  site  does  not  occur  between  structural  domains  but 

within  a  domain  explains  why  it  is  possible  to  “nick”  PA  with  trypsin  without 
causing  dissociation  of  the  2  resulting  polypeptide  chains  (12).  The  remaining  3 

strands  of  the  large  sheet  (residues  190-220)  pack  against  the  last  30  residues  of  the 
domain,  which  seem  to  have  little  regular  secondary  structure.  Domain  1  can  thus  be 
viewed  as  having  2  subdomains:  (i)  PA20  .which  forms  a  4+6  stranded  b  sandwich,  and 
(ii)  the  remaining  residues  (168-249),  which  fold  into  a  3-stranded  sheet  and  a 
random  coil.  Removal  of  PA20  would  not  only  disrupt  hydrogen  bonding  interactions 
in  this  subdomain  by  tearing  the  large  p -sheet,  but  would  also  cause  several 
hydrophobic  residues  to  become  exposed.  Perhaps  this  subdomain  is  poised  to 
undergo  a  conformational  change  once  PA20  is  removed.  We  do  not  yet  know  what 
functional  role  the  residues  in  this  subdomain  play,  but  since  PA  cannot  bind  EF  or 
LF  until  PA20  is  removed,  they  may  be  involved  in  EF/LF  binding. 

Domain  2 

Domain  2  consists  of  residues  250-487  organized  into  a  9-stranded  P -barrel  and 
2  a -helices  located  at  either  end  of  the  barrel.  The  barrel  contains  three  very  long 

strands  of  contour  lengths  39,  45  and  54A,  and  a  large  loop  made  up  of  residues  304- 

321.  The  loop  contains  two  hydrophobic  residues  at  its  tip,  Phe-313  and  Phe-314,  and 
is  susceptible  to  proteolysis  by  chymotrypsin  (which  cleaves  after  aromatic  residues) 

(7,12).  As  in  the  case  of  the  trypsin  site  of  domain  1,  it  is  possible  to  nick  PA  with 

chymotrypsin  without  causing  dissociation  into  two  fragments  (12,51).  However,  this 

treatment  renders  the  PA  (in  combination  with  LF)  non-toxic  to  cells  (51),  despite  its 
ability  to  bind  to  LF  and  to  cell  surface  receptors,  and  to  undergo  receptor-mediated 
endocytosis  (51).  A  deletion  mutant  lacking  the  two  Phe  residues  was  found  to  be 

defective  in  its  ability  to  form  oligomers  (53).  Taken  together,  these  data  suggest  that 
the  loop  has  a  role  in  oligomerization  and  is  probably  involved  in  an  intermolecular 
contact.  A  complementary  surface  has  not  yet  been  identified,  but  a  strong  candidate 
is  an  exposed  hydrophobic  patch  located  on  the  surface  of  domain  3,  described  below. 
The  length  of  domain  2  (~50A)  would  permit  it  to  span  a  membrane. 
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Figure  2:  Sequence  of  PA  with  secondary  structure  assignments 

Residues  homologous  with  iota-toxin  are  underlined. 

Domain  3 

Domain  3  consists  of  residues  488-594  and  is  the  smallest  of  the  domains.  It 
consists  primarily  of  a  4-stranded  p -sheet  and  4  a -helices.  One  helix  is  part  of  a 
triangular  strand-helix-loop  structure  that  permits  5  hydrophobic  residues  to  form  a 
flat  surface  exposed  to  the  solvent,  constituting  a  “hydrophobic  patch”.  In  crystals  of 
PA  this  patch  is  occupied  by  a  phenylalanine  residue  from  a  neighbouring  molecule. 
It  is  this  interaction  in  the  crystal  that  leads  us  to  wonder  whether,  in  an  oligomer  of 
PA63,  the  hydrophobic  patch  of  one  monomer  might  be  occupied  by  Phe-313  or  -314 
from  domain  2  of  a  neighbouring  monomer.  Further  studies  are  required  to 
determine  the  significance  of  this  patch  and  the  functional  role  of  the  entire 
domain. 


Domain  4 


mutants  have  implicated  domain  4  in  receptor  binding  (54,55).  Deletion  of  seven 
residues  from  the  C-terminus  leads  to  reduced  toxicity,  while  deletion  of  12  or  14 
residues  leads  to  complete  loss  of  activity  and  sensitivity  to  proteases.  The  structure 
provides  a  rationale  for  these  data:  protein  truncated  at  residue  728  would  have  half 
of  the  outer  p -strand  of  a  p -sheet  missing;  truncation  at  residue  723  would  have  all  the 
final  strand  missing  and  severely  affect  the  conformation  and  rigidity  of  the 
preceding  loop  between  residues  704-722  which  extends  into  the  solvent  and  also 
forms  part  of  the  interface  with  domain  2. 

While  domains  1-3  appear  quite  intimately  associated,  domain  4  is  much  less  so. 
The  only  contacts  are  the  initial  p -hairpin  joining  domain  4  with  domain  3  and  a 
small  contact  surface  with  domain  2.  It  is  therefore  conceivable  that  during 
membrane  insertion  (or  perhaps  at  another  step)  domain  4  separates  altogether  from 
domain  2,  swinging  away  to  the  left  in  figure  1. 

Comparison  of  the  two  crystal  forms 

We  haye  begun  a  detailed  analysis  of  the  two  cystal  forms.  While  the  structures 
are  very  similar  in  most  respects  there  are  some  small  but  fascinating  differences. 
The  two  different  crystal  forms  were  obtained  at  different  values  of  pH,  the  first  at 
pH  7.5  and  the  second  at  pH  6.  Two  regions,  consisting  of  residues  340-355  and  511-517 
that  are  ordered  in  the  pH  7.5  crystal  form  appear  to  be  disordered  at  the  lower  pH.  In 
the  second  case,  a  group  of  hydrophobic  sidechains  that  was  buried  in  the  pH  7.5 
form  become  exposed  in  the  pH  6  form.  This  has  potential  biologically  significance, 
as  it  may  represent  a  step  in  the  pH-induced  transformation  from  a  water-soluble  to  a 
membrane-inserting  molecule.  We  believe  that  these  observations  warrant  further 
investigation  to  determine  whether  the  changes  we  see  are  truly  pH-induced  or 
simply  the  product  of  different  crystal  environments. 

Homology  with  lota-Ib  toxin 

PA  shares  34%  sequence  identity  with  the  B-chain  of  the  Clo  striduim 
perfringens  iota  toxin,  another  toxin  whose  A  and  B  parts  exist  as  separate 
polypeptides  (56).  Like  PA,  lota-Ib  is  produced  in  a  proform  that  is  proteolytically 
activated  and  internalized  by  receptor-mediated  endocytosis  (57,58).  lota-Ib  is 
believed  to  coordinate  the  delivery  to  the  cytoplasm  of  the  A  moiety,  iota-la,  which 
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Residue 

Figure  3:  Homology  between  PA  and  iota-toxin  (see  text) 


ADP-ribosylates  actin  (59-61)  .  Residues  in  PA  identical  to  those  in  iota-Ib  are 
underlined  in  figure  2.  We  assume  that  residues  performing  similar  functions  in  PA 
and  iota-Ib  should  exhibit  a  greater  degree  of  similarity  than  those  which  interact 
with  the  receptors  or  with  the  A  moieties,  since  these  ligands  differ  for  the  two 
toxins.  The  highest  degree  of  similarity  occurs  in  the  middle  of  the  sequence  (figures 
2  and  3):  domains  2  and  3  have  40  and  37%  sequence  identity,  while  domains  1  and  4 
have  29  and  21%  identity,  respectively.  Four  of  the  five  residues  making  up  the 
hydrophobic  patch  of  domain  3  have  identical  corresponding  residues  in  iota-Ib,  and 
the  fifth,  Phe-552,  corresponds  to  a  leucine.  Thus,  a  hydrophobic  patch  likely  also 
exists  in  iota-Ib.  In  general,  the  high  homology  of  domains  2  and  3  is  consistent  with 
a  putative  role  in  oligomerization  and  membrane  insertion. 

Absence  of  homology  is  better  illustrated  in  figure  3,  in  which  each  PA  residue 
is  assigned  a  score  based  on  similarity  to  the  corresponding  iota-Ib  residue.  A  score  of 
2  is  given  for  identical  residues,  1  for  very  similar  residues  (e.g.  ile  vs.  leu,  glu  vs. 
asp),  -2  for  residues  missing  in  the  iota-Ib  sequence,  and  0  for  all  others.  Three  low- 
scoring  regions,  indicated  by  arrows,  clearly  stand  out.  Two  stretehes  occur  in 
domain  4,  consistent  with  a  role  in  receptor  binding.  The  third  and  longest  stretch 
occurs  between  residues  194-204,  which  are  located  in  the  3-stranded  b-sheet  of 
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Figure  4a:  Hypothetical  model  of  PA63:  top  view  (see  text) 

domain  1  (at  the  interface  of  PA20  domain  2).  The  low  degree  of  similarity  in  this 
region  would  be  consistent  with  a  role  in  EF/LF  binding,  since  the  A  moieties  in  the 
two  toxins  show  no  homology, 

A  hypothetical  model  for  oligomerization  and  membrane  insertion 

We  have  studied  two  crystal  forms  of  PA,  one  obtained  at  pH  7.5  and  the  other  at 
pH  6.  The  structure  shown  in  figure  1  is  that  observed  at  pH  7.5,  and  closely  resembles 
the  low  pH  structure.  The  largest  structural  difference  occurs  in  residues  242-255, 
located  at  the  base  of  the  barrel  formed  by  domain  2.  This  region  contains  3  hydro- 


Figure  4b:  Hypothetical  model  of  PA63:  side  view  (see  text) 

phobic  residues,  a  Trp,  a  Leu  and  a  Met,  in  the  sequence  x4WxxxMxLx3,  where  x 
represents  a  hydrophilic  residue.  At  pH  7.5  these  three  residues  are  buried  within  the 
core  of  the  (3 -barrel  but  become  exposed  at  pH  6  in  a  disordered  loop.  We  have  not  yet 
established  that  this  change  is  pH-induced  rather  than  a  consequence  of  the  crystal 
lattice,  but  it  is  tempting  to  hypothesize  that  the  exposure  of  hydrophobic  residues 
represents  a  step  in  pH-induced  membrane  insertion.  Interestingly,  the 
corresponding  sequence  in  iota-Ib  (x4WxxxLxIx3)  displays  a  similar  pattern  of 
amphipathicity. 
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axis  of  7-fold  rotational  symmetry,  with  domain  2  likely  facing  the  channel  interior 
and  domains  3  and  4  facing  outward.  A  hypothetical  model  of  the  heptamer  is  shown 
in  figure  4.  The  loops  of  domain  2  implicated  in  oligomerization  face  the  exterior, 

such  that  an  interaction  with  the  hydrophobic  patch  of  domain  3  from  a 
neighbouring  molecule  is  feasible.  The  WxxxMxL  streteh  possibly  involved  in 

membrane  insertion  and  the  receptor-binding  domain  are  located  at  the  bottom  of 
the  heptamer  in  figure  4b,  and  the  residues  suspected  of  binding  EF  and  LF  are  at  the 
top.  We  therefore  imagine  the  membrane  to  be  oriented  horizontally  at  the  bottom  of 
figure  4b,  such  that  membrane  insertion,  possibly  accompanied  by  lateral  separation 
of  the  receptor-binding  domain,  would  leave  the  putative  EF/LF  binding  site  exposed 

to  the  extracellular  environment,  consistent  with  the  ability  of  EF/LF  to  block  the 
conductance  of  membrane  channels  formed  by  PA63  (41). 

We  repeat  that  this  is  still  a  hypothetical  model,  and  that  confirmation 

requires  a  full  crystallographic  analysis  of  the  heptamer.  We  have  obtained  crystals 
that  diffract  to  4.5A  resolution,  so  that,  together  with  the  high  resolution  structure  of 
the  monomeric  PA,  we  should  be  able  to  build  an  accurate  model  of  the  heptamer  in 
the  near  future. 


CONCLUSIONS 

The  work  described  in  this  report  defines  for  the  first  time  the  atomic 
resolution  structure  of  the  major  antigenic  component  of  the  anthrax  toxin,  the 
'protective  antigen',  revealing  that  the  molecule  comprises  four  distinct  domains. 
Together  with  existing  biochemical  and  genetic  data,  we  can  now  begin  to  assign 
functions  to  each  domain,  although  some  of  these  assignments  must  remain 
speculative  until  the  crystal  structure  of  the  oligomeric  PA63  has  been  determined. 

A  number  of  suggestions  for  directions  to  an  improved  vaccine  immediately 
suggest  themselves.  For  example,  our  crystallographic  results  and  the  biochemical 
data  from  other  groups  lead  to  the  assignment  of  Domain  4  as  the  'receptor-binding' 
domain;  our  precise  definition  of  the  residues  comprising  this  domain  will  allow  it  to 
be  expressed  as  a  recombinant  protein  and  tested  for  its  immunogenic  and  protective 
properties.  The  sequence  between  residues  310-324  forms  a  large  disordered  loop  that 
may  be  required  for  oligomerization  to  form  PA63.  Mutations  in  this  loop  that 
prevent  oligomerization  could  produce  a  defective  PA  molecule  that  was  otherwise 
normally  immunogenic  (53).  Our  data  also  suggest  a  range  of  mutations  that  can  be 
used  to  test  the  functional  role  of  other  residues  on  the  protein  surface. 

The  work  on  the  full-length  protective  antigen  molecule  is  now  nearly 
complete,  ahead  of  schedule  from  our  original  estimate.  We  will  now  focus  on  the 
structure  of  the  oligomeric  PA63  molecule,  which  is  progressing  on  schedule.  We  will 
also  take  this  opportunity  to  extend  the  work  to  the  Lethal  Factor  (LF)  protein.  We 
have  crystals  of  the  full  length  LF  that  diffract  to  better  than  3A;  also  the  N-terminal 
domain  of  LF  (LFjsf)  is  available  as  a  recombinant  fragment,  and  we  will  attempt 
crystallization  of  this  fragment,  both  by  itself  and  in  complex  with  PA63,  allowing  us 
to  define  the  molecular  interface  between  PA63  and  LF.  The  success  of  chimeric  LF 
molecules  in  translocating  foreign  molecules  into  the  cytoplasm  suggests  that  this 
information  will  be  also  of  use  in  the  generation  of  targeted  toxins  (62). 
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